Multimedia Classifier

نویسندگان

  • Gabriel Nicolae Costache
  • Inge Gavat
چکیده

Along with the aggressive growing of the amount of digital data available (text, audio samples, digital photos and digital movies joined all in the multimedia domain) the need for classification, recognition and retrieval of this kind of data became very important. In this paper will be presented a system structure to handle multimedia data based on a recognition perspective. The main processing steps realized for the interesting multimedia objects are: first, the parameterization, by analysis, in order to obtain a description based on features, forming the parameter vector; second, a classification, generally with a hierarchical structure to make the necessary decisions. For audio signals, both speech and music, the derived perceptual features are the melcepstral (MFCC) and the perceptual linear predictive (PLP) coefficients. For images, the derived features are the geometric parameters of the speaker mouth. The hierarchical classifier consists generally in a clustering stage, based on the Kohonnen SelfOrganizing Maps (SOM) and a final stage, based on a powerful classification algorithm called Support Vector Machines (SVM). The system, in specific variants, is applied with good results in two tasks: the first, is a bimodal speech recognition which uses features obtained from speech signal fused to features obtained from speaker’s image and the second is a music retrieval from large music database.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Wireless Multimedia Sensor Networks for Collaborative Hybrid Semi-Supervised Classifier Learning

Wireless multimedia sensor networks (WMSN) have recently emerged as one ofthe most important technologies, driven by the powerful multimedia signal acquisition andprocessing abilities. Target classification is an important research issue addressed in WMSN,which has strict requirement in robustness, quickness and accuracy. This paper proposes acollaborative semi-supervised classifier learning al...

متن کامل

Human activity recognition by combining discriminative and generative classifiers

In this article, we propose a novel algorithm for the recognition of complex activities in multimedia streams. The algorithm consists of a discriminative feature classifier based on random forests and a generative classifier, for which we use the hierarchical hidden Markov model. The discriminative feature classifier checks the existence or absence of the steps required for the execution of an ...

متن کامل

Semantic Adaptation of Neural Network Classifiers in Image Segmentation

Semantic analysis of multimedia content is an on going research area that has gained a lot of attention over the last few years. Additionally, machine learning techniques are widely used for multimedia analysis with great success. This work presents a combined approach to semantic adaptation of neural network classifiers in multimedia framework. It is based on a fuzzy reasoning engine which is ...

متن کامل

A Review of Image Classification Techniques in Content Based Image Retrieval

As the growth and development of various multimedia technologies in the field of CBIR many advanced information retrieval systems have become popular and has brought the new evolution in fast and effective retrieval. In this paper the techniques of image classification in CBIR are been discussed and compared. It also introduces classifiers like support vector machine, Bayesian classifier for ac...

متن کامل

A General Framework for Classifier Adaptation and its Applications in Multimedia

For the analysis and retrieval of multimedia data, machine learning techniques have been extensively applied to build models that map various feature vectors of the data into semantic labels. As multimedia data come from a wide variety of domains (e.g., genres, sources), each having its distinctive data characteristics, models trained from one domain do not usually generalize well to other doma...

متن کامل

Distributed Classifier Chain Optimization for Real-time Multimedia Stream Mining Systems

We consider the problem of optimally configuring classifier chains for real-time multimedia stream mining systems. Jointly maximizing the performance over several classifiers under minimal end-to-end processing delay is a difficult task due to the distributed nature of analytics (e.g. utilized models or stored data sets), where changing the filtering process at a single classifier can have an u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004